Search CORE

22 research outputs found

CNN-LSTM Architecture for Action Recognition in Videos

Author: Berlles Julio Jacobo
Buemi María E.
Orozco Carlos Ismael
Publication venue
Publication date: 01/09/2019
Field of study

CNN-LSTM Architecture for Action Recognition in Videos

Author: Berlles Julio Jacobo
Buemi María E.
Orozco Carlos Ismael
Publication venue
Publication date: 17/02/2020
Field of study

CNN-LSTM Architecture for Action Recognition in Videos

Author: Berlles Julio Jacobo
Buemi María E.
Orozco Carlos Ismael
Publication venue
Publication date: 01/09/2019
Field of study

Servicio de Difusión de la Creación Intelectual

BiLSTM with CNN Features For HAR in Videos

Author: Berlles Julio Jacobo
Buemi María Elena
Orozco Carlos Ismael
Publication venue
Publication date: 30/08/2022
Field of study

El reconocimiento de acciones en videos es actualmente un tema de interés en el área de visión por computadora debido a sus potenciales aplicaciones tales como indexación en multimedia, vigilancia en espacios públicos, entre otras. En este trabajo proponemos una arquitectura CNN-BiLSTM. Primero, una red neuronal convolucional VGG16 previamente entrenada extrae las características del video de entrada. Luego, un BiLSTM clasifica el video en una clase en particular. Evaluamos el rendimiento de nuestro sistema utilizando la precisión como métrica de evaluación, obteniendo 40.9% y 78.1% para los conjuntos de datos HMDB-51 y LTCF-101 respectivamente.Sociedad Argentina de Informática e Investigación Operativ

Servicio de Difusión de la Creación Intelectual

CNN–LSTM con mecanismo de atención suave para el reconocimiento de acciones humanas en videos

Author: Buemi María Elena
Jacobo Berlles Julio
Orozco Carlos Ismael
Publication venue: 'Facultad de Ingenieria del la Universidad de Buenos Aires'
Publication date: 01/01/2021
Field of study

Action recognition in videos is currently a topic of interest in the area of computer vision, due to potential applications such as: multimedia indexing, surveillance in public spaces, among others. Attention mechanisms have become a very important concept within deep learning approach, their operation tries to imitate the visual capacity of people that allows them to focus their attention on relevant parts of a scene to extract important information. In this paper we propose a soft attention mechanism adapted to a base CNN–LSTM architecture. First, a VGG16 convolutional neural network extracts the features from the input video. Then an LSTM classifies the video into a particular class. To carry out the training and testing phases, we used the HMDB-51 and UCF-101 datasets. We evaluate the performance of our system using accuracy as an evaluation metric, obtaining 40,7 % (base approach), 51,2 % (with attention) for HMDB-51 and 75,8 % (base approach), 87,2 % (with attention) for UCF-101.El reconocimiento de acciones en videos es actualmente un tema de interés en el área de la visión por computador, debido a potenciales aplicaciones como: indexación multimedia, vigilancia en espacios públicos, entre otras. Los mecanismos de atención se han convertido en un concepto muy importante dentro del enfoque de aprendizaje profundo, su operación intenta imitar la capacidad visual de las personas que les permite enfocar su atención en partes relevantes de una escena para extraer información importante. En este artículo proponemos un mecanismo de atención suave adaptado para degradar la arquitectura CNN–LSTM. Primero, una red neuronal convolucional VGG16 extrae las características del video de entrada. Para llevar a cabo las fases de entrenamiento y prueba, usamos los conjuntos de datos HMDB-51 y UCF-101. Evaluamos el desempeño de nuestro sistema usando la precisión como métrica de evaluación, obteniendo 40,7 % (enfoque base), 51,2 % (con atención) para HMDB-51 y 75,8 % (enfoque base), 87,2 % (con atención) para UCF-101

Facultad de Ingeniería de la Universidad de Buenos Aires: Elektron

DIALNET

Stereo Parallel Tracking and Mapping for Robot Localization

Author: Civera Sancho Javier
de Cristóforis Pablo
Fischer Thomas
Jacobo Berlles Julio César
Pire Taihú
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

This paper describes a visual SLAM system based on stereo cameras and focused on real-time localization for mobile robots. To achieve this, it heavily exploits the parallel nature of the SLAM problem, separating the time-constrained pose estimation from less pressing matters such as map building and refinement tasks. On the other hand, the stereo setting allows to reconstruct a metric 3D map for each frame of stereo images, improving the accuracy of the mapping process with respect to monocular SLAM and avoiding the well-known bootstrapping problem. Also, the real scale of the environment is an essential feature for robots which have to interact with their surrounding workspace. A series of experiments, on-line on a robot as well as off-line with public datasets, are performed to validate the accuracy and real-time performance of the developed method

Repositorio Universidad de Zaragoza

Reconocimiento de Acciones Humanas en Videos usando una Red Neuronal CNN LSTM Robusta

Author: Berlles Julio Jacobo
Buemi María Elena
Orozco Carlos Ismael
Xamena Eduardo
Publication venue: 'Fundacion Universidad de Palermo'
Publication date: 01/01/2020
Field of study

Action recognition in videos is currently a topic of interest in the area of computer vision, due to potential applications such as: multimedia indexing, surveillance in public spaces, among others. In this paper we propose (1) The implementation of a CNN–LSTM architecture. First, a pre-trained VGG16 convolutional neural network extracts the features of the input video. Then, an LSTM classifies the video sequence in a particular class. (2) A study of how the number of LSTM units affects the performance of the system. To carry out the training and test phases, we used the KTH, UCF-11 and HMDB-51 datasets. (3) An evaluation of the performance of our system using accuracy as evaluation metric, given the existing balance of the classes in the datasets. We obtain 93%, 91% and 47% accuracy respectively for each dataset, improving state of the art results for the former two. Besides the results attained, the main contribution of this work lays on the evaluation of different CNN-LSTM architectures for the action recognition task.El reconocimiento de acciones en videos es actualmente un tema de interés en el área de visión por computadora, debido a potenciales aplicaciones como: indexación multimedia, vigilancia en espacios públicos, entre otras. En este artículo proponemos: (1) Implementar una arquitectura CNN–LSTM para esta tarea. Primero, una red neuronal convolucional VGG16 previamente entrenada extrae las características del video de entrada. Luego, una capa LSTM determina la clase particular del video. (2) Estudiar cómo la cantidad de unidades LSTM afecta el rendimiento del sistema. Para llevar a cabo las fases de entrenamiento y prueba, utilizamos los conjuntos de datos KTH, UCF-11 y HMDB-51. (3) Evaluar el rendimiento de nuestro sistema utilizando la precisión como métrica de evaluación, dado el balance existente entre las clases de los conjuntos de datos. Obtenemos un 93%, 91% y 47% de precisión respectivamente para cada conjunto de datos, mejorando los resultados del estado del arte para los primeros dos. Además de los resultados obtenidos, la principal contribución de este trabajo yace en la evaluación de diferentes arquitecturas CNN-LSTM para la tarea de reconocimiento de acciones

Portal de Publicaciones Periodicas UP (Universidad de Palermo)

CONICET Digital

DIALNET

Models for Synthetic Aperture Radar Image Analysis

Author: Correa Antonio
Freitas Corina da da C.
Frery Alejandro C.
Jacobo-Berlles Julio
Mejail Marta
Rennó Camilo D.
Sant'anna Sidnei J.S.
Vasconcellos Klaus L.P.
Publication venue: Universidade de São Paulo. Instituto de Matemática e Estatística
Publication date: 17/03/2014
Field of study

After reviewing some classical statistical hypothesis commonly used in image processing and analysis, this paper presents some models that are useful in synthetic aperture radar (SAR) image analysis

Cadernos Espinosanos (E-Journal)

Polarimetric SAR Image Segmentation with B-Splines and a New Statistical Model

Author: A. Blake
A. C. Frery
A. C. Frery
A. C. Frery
A. Frery
A.D.C. Nascimento
Alejandro C. Frery
B. Jørgensen
C. C. Freitas
C. Oliver
D. F. Rogers
F. Cribari-Neto
F. Goudail
G. Davidson
H. Allende
H. Wang
H.A. Zebker
I. S. Gradshteyn
J. -S. Lee
J. Gambini
J. Gambini
J. K. Udupa
J. W. Goodman
J.-S. Lee
Juliana Gambini
Julio Jacobo-Berlles
K. Conradsen
K. L. P. Vasconcellos
L. Devroye
L. Zhang
M. A. T. Figueiredo
M. E. Mejail
M. Mejail
M. Migliaccio
M. Quartulli
M. S. Srivastava
M. Silva
Marta E. Mejail
N. R. Goodman
N. R. Goodman
O. H. Bustos
S. D. Gordon
S. H. Yueh
S. J. S. Sant’Anna
V. Seshadri
W. Dierking
W. Dierking
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

We present an approach for polarimetric Synthetic Aperture Radar (SAR) image region boundary detection based on the use of B-Spline active contours and a new model for polarimetric SAR data: the GHP distribution. In order to detect the boundary of a region, initial B-Spline curves are specified, either automatically or manually, and the proposed algorithm uses a deformable contours technique to find the boundary. In doing this, the parameters of the polarimetric GHP model for the data are estimated, in order to find the transition points between the region being segmented and the surrounding area. This is a local algorithm since it works only on the region to be segmented. Results of its performance are presented

arXiv.org e-Print Archive

CiteSeerX

Crossref

New families of polarimetric distributions for SAR images

Author: Jacobo Berlles Julio C. A.
Publication venue: Facultad de Ciencias Exactas y Naturales. Universidad de Buenos Aires
Publication date
Field of study

En esta tesis se deriva una nueva distribución polarimétrica para datos de radar de apertura sintética (Synthetic Aperture Radar - SAR). Esta distribución se basa en el uso del modelo multiplicativo, suponiendo una ley Wishart compleja multivariada para el speckle y la ley gaussiana inversa para el backscatter. Con esta propuesta se obtiene la distribución harmónica polarimétrica y, como caso particular, las distribuciones harmónicas para datos de intensidad y de amplitud. Se calculan los estimadores para los parámetros que indexan estas distribuciones por el método de los momentos. Se muestra que la extracción de estos parámetros como características es una forma de aumentar la información y el poder de discriminación en el problema de clasificación de imágenes SAR.This thesis presents the derivation of a new distribution for polarimetric Synthetic Aperture Radar (SAR) imagery. This distribution is based on the multiplicative model, assuming a multivariate complex Wishart law for the speckle and an inverse gaussian law for the backscatter. With this proposal, the harmonic polarimetric distribution and the harmonic distributions for intensity and amplitude data are obtained. Moments-based estimators for the parameters that index these distributions are derived and assessed. It is shown that the extraction of these parameters as features is a way of augmenting the information content and the discriminating power in SAR image classification.Fil:Jacobo Berlles, Julio C. A.. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales; Argentina

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Rectorado de la Universidad de Buenos Aires

Biblioteca Digital Biblioteca Digital de la Facultad de Ciencias Exactas y Naturales de la Universidad de Buenos Aires (Biblioteca Digital FCEN-UBA)